A Novel Approach to Finding Near-Cliques: The Triangle-Densest Subgraph Problem
نویسنده
چکیده
Many graph mining applications rely on detecting subgraphs which are large near-cliques. There exists a dichotomy between the results in the existing work related to this problem: on the one hand formulations that are geared towards finding large near-cliques are NP-hard and frequently inapproximable due to connections with the Maximum Clique problem. On the other hand, the densest subgraph problem (DS-Problem) which maximizes the average degree over all subgraphs and other indirect approaches which optimize tractable objectives fail to detect large near-cliques in many networks. In this work, we propose a formulation which combines the best of both worlds: it is solvable in polynomial time and succeeds consistently in finding large near-cliques. Surprisingly, our formulation is a simple variation of the DS-Problem. Specifically, we define the triangle densest subgraph problem (TDS-Problem): given a graph G(V,E), find a subset of vertices S∗ such that τ(S∗) = max S⊆V t(S) |S| , where t(S) is the number of triangles induced by the set S. We provide various exact and approximation algorithms which the solve TDS-Problem efficiently. Furthermore, we show how our algorithms adapt to the more general problem of maximizing the k-clique average density, k ≥ 2. We illustrate the success of the proposed formulation in extracting large near-cliques from graphs by performing numerous experiments on real-world networks.
منابع مشابه
Approximation schemes, cliques, colors and densest subgraphs
In this thesis we study the problem of finding the densest k-subgraph of a given graph G = (V, E). We present algorithms of polynomial time as well as approximation results on special graph classes. Analytically, we study polynomial time algorithms for the densest k-subgraph problem on weighted graphs of maximal degree two, on weighted trees even if the solution is disconnected, and on interval...
متن کاملFinding Dense Subgraphs with Size Bounds
We consider the problem of finding dense subgraphs with specified upper or lower bounds on the number of vertices. We introduce two optimization problems: the densest at-least-k-subgraph problem (dalks), which is to find an induced subgraph of highest average degree among all subgraphs with at least k vertices, and the densest at-most-k-subgraph problem (damks), which is defined similarly. Thes...
متن کاملOn the densest k-subgraph problem
Given an n-vertex graph G and a parameter k, we are to nd a k-vertex subgraph with the maximum number of edges. This problem is N P-hard. We show that the problem remains N P-hard even when the maximum degree in G is three. When G contains a k-clique, we give an algorithm that for any > 0 nds a k-vertex subgraph with at least (1 ?) ? k 2 edges, in time n O((1+log n k)==). We study the applicabi...
متن کاملNP-hardness results for partitioning graphs into disjoint cliques and a triangle-free subgraph
This paper investigates the computational complexity of deciding whether the vertices of a graph can be partitioned into a disjoint union of cliques and a triangle-free subgraph. This problem is known to be NP-complete on arbitrary graphs. We show that this problem remains NP-complete even when restricted to planar graphs and perfect graphs.
متن کاملThe densest k-subgraph problem on clique graphs
The Densest k-Subgraph (DkS) problem asks for a k-vertex subgraph of a given graph with the maximum number of edges. The problem is strongly NPhard, as a generalization of the well known Clique problem and we also know that it does not admit a Polynomial Time Approximation Scheme (PTAS). In this paper we focus on special cases of the problem, with respect to the class of the input graph. Especi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1405.1477 شماره
صفحات -
تاریخ انتشار 2014